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ABSTRACT 

Microwave, submillimetre-wave, and far-infrared phased arrays are of considerable importance for astronomy. We 
consider the behaviour imaging phased arrays and interferometric phased arrays from a functional perspective, 
^vq , It is shown that the average powers, field correlations, power fluctuations, and correlations between power 
_ ■ fluctuations at the output ports of an imaging or interferometric phased array can be found once the synthesised 
reception patterns are known. The reception patterns do not have to be orthogonal or even linearly independent. 
jjy ■ It is shown that the operation of phased arrays is intimately related to the mathematical theory of frames, and 
that the theory of frames can be used to determine the degree to which any class of intensity or field distribution 
can be reconstructed unambiguously from the complex amplitudes of the travelling waves at the output ports. 
\ The theory can be used to set up a likelihood function that can, through Fisher information, be used to determine 
t-H ■ the degree to which a phased array can be used to recover the parameters of a parameterised source. For example, 
it would be possible to explore the way in which a system, perhaps interferometric, might observe two widely 
■ separated regions of the sky simultaneously. 
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There is considerable interest in developing phased arrays for radio astronomy. Projects include the Square 
Kilometer Array (SKA), the Low Frequency Array (LOFAR), the Electronic Multibeam Radio Astronomy Con- 
cept (EMBRACE), and the Karoo Array Telescope (KAT). 1-3 All of these projects are aimed constructing 
phased arrays for microwave astronomy, but as technological capability improves, phased arrays will eventually 
■ be constructed for submillimetre-wave and far-infrared astronomy. 4,5 

Two types of phased array are of interest: (i) imaging phased arrays, where an array of coherent receivers 
is connected to a beam-forming network such that synthesised beams can be created and swept across the 
sky; (ii) interferometric phased arrays, where the individual antennas of an aperture synthesis interferometer are 
equipped with phased arrays such that fringes are formed within the synthesised beams. In this way it is possible 
to extend the field of view, to observe completely different regions of the sky simultaneously, to steer the field 
of view electronically, and to observe spatial frequencies that are not available to an interferometer because the 
baselines cannot be made smaller than the diameters of the individual antennas. 

It is important to recognise that the synthesised beams of a phased array need not be orthogonal, and may 
even be linearly dependent. Non-orthogonality may be built into a system intentionally as a way of increasing 
the fidelity with which an image can be reconstructed, or it may arise inadvertently as a consequence of RF 
coupling and post-processing cross-talk. In some situations, say in the case of interacting planar antennas, it 
may not even be clear how to distinguish one basis antenna from another, even before the beam-forming network 
has been connected. 

In this paper, we show that the only information that is needed to determine the average powers, the 
correlations between the complex travelling wave amplitudes, the fluctuations in power, and the correlations 
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Figure 1. A generic phased array having M horns and P ports. 

between the fluctuations in power at the output ports of a phased array, or between the output ports of phased 
arrays on different antennas, is the synthesised beams, ft is not necessary to know anything about the internal 
construction of the arrays or the beam forming networks. Beam patterns may be taken from electromagnetic 
simulations or experimental data, fn the case of interferometric phased arrays, the arrays on the individual 
antennas do not have to be the same. 

The ability to assess the behaviour of a system simply from the synthesised beam patterns separates the 
process of choosing the best beams for a given application from the process of understanding how to realise 
the beams in practice. It also suggests important techniques for simulating phased arrays, and for analysing 
experimental data. 

2. BASIC PRINCIPLES 

In practice, an imaging phased array comprises a sequence of optical components, an array of single-mode 
antennas, and an electrical beam-forming network such that each output port corresponds to a synthesised 
reception pattern on the input reference surface, usually the sky. In some cases, the synthesised reception 
patterns may be static and designed to give optimum sampling on a given class of object, whereas in other cases, 
the beam-forming network may be controlled electrically to generate a set of synthesised beams that can be 
swept across the field of view. In the case of microwave astronomy, the optical system would be a telescope, the 
single-mode antennas would be the horns or planar antennas of an array of HEMT amplifiers or SIS mixers, and 
the beam-forming network would be a system of microwave or digital electronics. 

Our analysis is based on the generic system shown in Fig. I. A denotes the input reference surface, B the 
output ports of the horns, and C the output ports of the beam-forming network. We shall assume that an 
array of M horns is connected to a beam-forming network having P output ports. Each of the P ports is thus 
associated with a reception pattern on the input reference surface. For simplicity, we shall assume paraxial optics 
throughput. When a pseudomonochromatic field, x(r), is incident on the system, a set of travelling waves will 
appear at B: we shall denote their complex amplitudes by {y m : m G 1, • • • , M}. We shall use the notion of 
complex analytic signals throughout, which for most practical purposes means that one can integrate the final 
result over some bandwidth to calculate general behaviour. Likewise, a set of travelling waves will appear at 
C: we shall denote their complex amplitudes by {z p : p e 1, • • • , P}. When M and P are finite, the complex 
amplitudes can be assembled into column vectors y G C M and z G C p , respectively. 

In what follows, it will sometimes be beneficial to represent the primary variables as abstract vectors. Because 
the incoming field, x(r), is square integrable over the input reference surface, A, it can be represented by a vector 
|x) in Hilbert space H. The input surface may extend to infinity, or it may be bounded by an aperture, and 
therefore of finite extent. Regions having different shapes and sizes correspond to different Hilbert spaces, y 
and z can also be represented by abstract vectors, |y) G £ 2 and |z) G t 2 respectively, where I 2 is the space 
of square-summable sequences. These definitions lead to two operators, one of which, H : H — > I 2 , maps the 




incoming optical field onto the outputs of the horns, and the other 3? : t 2 — * £ 2 maps the outputs of the horns onto 
the outputs of the beam-forming network. These individual operators can be combined into a single composite 
operator T = <&H : H — > I 2 , which describes the system as a whole. 

It can be shown, Appendix A, using only the concepts of inner product, operators, and adjoints in Hilbcrt 
space, that the complex travelling- wave amplitude appearing at port p, when a field, x(r), is incident on a system 
is given by 

z p = [ t;(r)-x(r)d 2 r, (1) 
J A 

where t p (r) is the functional form of the p'th synthesised reception pattern. A corresponds to the surface and 
region over which the Hilbert space is defined. In expressions such as (1) we shall show the the complex conjugate 
explicitly, even though some notation includes it in the dot product, as an inner product, implicitly. The reason 
for the formality in stating, and indeed deriving (1), is that (1) can be shown to be true even when the beam 
patterns arc not orthogonal. 

The synthesised reception patterns are central to what follows because, according to (1), the complex travelling 
wave appearing at port p is given by calculating the inner product, over the input reference surface, between the 
synthesised reception pattern t p (r) and the incoming field. It would be naive to assume, however, that when a 
system is illuminated by a field having the form t p (r) , a travelling wave only appears at p . In the case of phased 
arrays, the synthesised reception patterns do not have to be orthogonal, and can even be linearly dependent. 
Thus, although the output at a given port is given by the inner product between a field and a reception pattern, 
as for orinary antennas, one cannot assume that there is a one-to-one mapping between the antenna patterns 
and the ports. 

For example, in the case of Fig. 1, the beam patterns of the horns, h m (r), are orthogonal, and the outputs 
of the horns, y m , are given by 

y m = [ h^(r)-x(r)d 2 r, (2) 

J A 

but the beam- forming network is described by a linear operator €>, and therefore 

z p = ^ ' 'PpmUm- (3) 
m 

Substituting (3) in (2) we find 

z v = / y^pmh^r) • x(r) d 2 r, (4) 
which can be cast into the form of (1) by defining 

t P (r)=]>> pm b4(r). (5) 

m 

As expected, the synthesised reception patterns are merely weighted linear combinations of the horn patterns. 
The orthogonality of the synthesised reception patterns can now be tested through 

/ t;(r).V(r)d 2 r = ^0 pm 0;, m , (6) 

where (5) has been used, together with the orthonormality of the horn patterns. 

In the case where the numbers of horns and ports are finite, (6) takes the form of a matrix equation: 

/ t;(r)-V(r)d 2 r = **t. (7) 

J A 



Because <J> is mapping between C M and C p , 3? is under complete if P > M, and is singular; contrariwise, 
$ is over complete if P < M, and is not singular. In both cases, except trivially when certain ports are 



not connected, the synthesised reception patterns are not orthogonal, because <&<I>^ ^ Ip is not diagonal. In the 
case where <& is unitary, 3><l>t — Ip, where Ip is the identity operator of dimension P, the synthesised reception 
patterns are orthogonal. Butler beam forming networks are used in practice to realise this situation. 

In summary, the complex travelling-wave amplitudes appearing at the output ports of a phased array are 
found by calculating the inner products of the incoming field with respect to a set of synthesised reception 
patterns, but the synthesised reception patterns do not have to be orthogonal. Even if a system is designed to 
have orthogonal beams, practical issues relating to coupling and cross talk will cause the beam patterns to be 
non-orthogonal at some level. One would, therefore, like to derive an analysis procedure based on the beam 
patterns alone, where it is not necessary to know anything about the internal construction of the array. For our 
purposes, we shall assume that the behaviour of all phased arrays is described by (1) regardless of whether it is 
known how the arrays are constructed or not. 

In many cases we are interested in using phased arrays to image incoherent or partially coherent fields — in 
the context of astronomy, although the field on the sky is usually incoherent, the input reference plane, as far 
as the phased array is concerned, may be internal to the optics of the telescope. To this end, it is convenient to 
introduce correlation dyadics. We shall define the correlation dyadic of the incident field according to 

X(r',r) = (x(r)x*(r')}, (8) 

where ( ) denotes the ensemble average, and x(r) is interpreted as a complex analytic signal. The tensor X(r', r) 
contains complete information about the correlation between the fields at any two points and in any two polari- 
sations. Once the correlation dyadic is known, all classical measures of coherence follow. 

The correlation between the travelling wave amplitudes at any two ports can be written (z p z*,), or in matrix 
form 

Z = zzt, (9) 
where Z G C PxP is a correlation matrix. The matrix elements of Z can be found by using (1): 

Z PP > = f ( t;(r) • l(r', r) • V (O d 2 r d 2 r'. (10) 
J A J A 

Now illuminate the system with an unpolarised, spatially fully incoherent source 

X(r',r) =I<5(r - r'), (11) 
where I is the dyadic identity operator. Substituting (11) in (10), wc find 

Z vv , = [ t;(r)-V(r)d 2 r, (12) 
J A 

which shows that, because the synthesised reception patterns are generally not orthogonal, the travelling waves 
at the output ports are correlated. Ultimately, it is these correlations that prevent one from extracting more 
and more information from a source, using a finite number of horns, by synthesising more and more beams. 

3. FRAMES 

In what follows, we shall need to make use of the mathematical theory of frames. Suppose for the moment 
we have some general monochromatic field |x), and that we determine the inner products with respect to a 
set of basis vectors T = {|t p ), p G 1, ■ • • , P}: z p = (t p |x). P can extend to infinity, and we do not make any 
assumptions about the orthonormality or linear independence of T. Under what circumstances can the original 
vector |x), which represents a continuous function, be recovered unambiguously from a discrete set of complex 
coefficients, possibly countable, and how can this be achieved? In the context of phased arrays, we are essentially 
asking under what circumstances can the form of an incident field be recovered from the outputs of an array, 
when the synthesised beams are possibly non-orthogonal and linearly dependent. 



Evaluate the square moduli of the inner products between T and any general vector, |x), and sum the results. 
If there are two constants A and B such that < A < oo and < B < oo, and 

^||x|| 2 <||f|x)|| 2 <B || xf, (13) 

which can also be written 

A||xf< ^T|<t p |x> H | 2 < 5||x|| 2 , (14) 

p 

V |x) G H, then the basis set T is called a frame with respect to H. Notice the use of strict inequalities in the 
allowable values of A and B. In the case where A w B, the frame is called a 'tight frame' because the inner 
products for all |x) G H lie within some small range, and the dynamic range needed for inversion is small. When 
the original basis is orthonormal, the frame bounds, A and B, are equal, as can be appreciated by inserting 
x) = \t p i) in (14). If the frame is over complete, but normalised, A is a measure of the redundancy in the frame. 

If a basis set constitutes a frame, then it can be shown, through (13) alone, that T is injective, one-to-one, 
but not surjective, onto: T maps H onto a subspace of £ 2 , or when P is finite, a subspace of C p . Consequently, 
T has a left inverse, T" 1 , such that |x) = T _1 T|x) : Vx G H. t" 1 maps the image of T, Im[T], back onto H, 
and maps the null complement of Im[T], ImfT]- 1 -, onto the zero vector in H. The inverse operator is given by 

f- 1 = (ftf ) 1 f t =S- 1 f t : (15) 

S is bijective such that S _1 exists. 

It can also be shown that T _1 satisfies the frame condition 

illzf^llf^lz)!! 2 ^ ^Ilzf. (16) 

Thus, the more tightly bound the frame, the more tightly bound the inverse, and the more stable the reconstruc- 
tion process. 

The reconstruction of the original field through T" 1 can be best implemented by the introduction of dual 
vectors. The dual vectors |t p ) of any given frame T, with respect to Hilbcrt space H, can be derived through 

|t p ) = S- 1 |t p ). (17) 

The dual basis set, which we shall call T = {|t p ), p G 1, • • • ,P}, has the same degree of completeness as the 
original frame, T, and therefore it too constitutes a frame with respect to H. Indeed, two representations of any 
general |x) are possible: 

|x> = 5> p |x)|t p ) (18) 

p 

|x) - ]T(t p |x) |t p >. 

p 

If one calculates a set of coefficients by taking the inner products with a frame, then one inverts the process by 
reconstructing the field using the dual vectors. Alternatively, if one calculates the inner products with respect 
to the dual vectors, then one inverts the process by reconstructing the field using the frame. In the case where 
the basis vectors are perfectly complete with jespect to H, but not necessarily orthogonal, the basis is called a 
Riesz basis, and the basis set T and dual set T are then biorthogonal: (t p |t p /) = 5 PP < : Vp,p' G 1, • • • , P. 

In the case where the basis vectors do not constitute a frame, that is to say they are under complete with 
respect to H, then one can go through the same procedure as before, but now, when an attempt is made to 
reconstruct the original field vector, by using the duals, 

|x') = ]T<t p |x>|t p }, (19) 

p 



the reconstructed vector |x'} cannot, for all vectors in H, be the same as the original vector |x). It can be shown, 
straightforwardly, that the error vector |x) — |x'} is orthogonal to the basis vectors; in other words, the solution 
is as close as possible within the degrees of freedom available. |x') is the orthogonal projection of |x) onto §, the 
subspace spanned by the under complete set of basis vectors. In the context of functions, the reconstructed field 
is a least-square fit to the original field. The same conclusion is reached if the inner products arc taken with the 
dual vectors of an incomplete basis, and the field reconstructed using the original vectors. 

The relevance to phased arrays is clear, one can measure the complex outputs of a phased array, and if 
the reception patterns constitute a frame with respect to the Hilbert space defined by the shape, extent, and 
illumination of the input reference surface, then the continuous incoming field can be reconstructed completely 
from the discrete set of outputs. If the reception patterns do not constitute a frame, reconstruction leads to the 
least square fit that is consistent with the degrees of freedom to which the phased array is sensitive. If the field is 
a spatially fully incoherent source, the number of degrees of freedom in the field is infinite, even if the field only 
extends over a finite region, and an infinite number of horns is needed to realise a frame. In reality, however, all 
optical fields only contain a finite number of degrees of freedom, and therefore frames are, at least in principle, 
possible. 



4. MATRIX REPRESENTATIONS 

The theory of frames is intimately related to the operation of phased arrays. Suppose, for example, that we 
wish to describe the behaviour of a phased array by means of a scattering matrix that relates, for any incoming 
field, the P reception-pattern coupling coefficients to the P output ports. One such representation is simply the 
P x P identity matrix, Ip, because the outputs are given by the coupling coefficients in any case, but such a 
representation does not correctly represent the throughput of the system if the number of horns is less than the 
number of ports, because there are fewer degrees of freedom in the calculated coefficients than the number of 
coefficients. The identity matrix does not contain any information about the physics of the array, which becomes 
apparent when one comes to consider internally generated noise. 

A better approach is as follows. Using the concept of frames, we can generate a set of coupling coefficients 
by representing the field in terms of the duals of the reception patterns. From (1) 



/ t;(r) • Y, v V ( r ) d " r = E R pp ,z 'p' ■ 



(20) 



where 



z' p ,= f t;(r)-x(r)rf 2 r. (21) 

J A 



I A 

Alternatively, according to (18), we may represent the field in terms of the reception patterns themselves: 



/ t;(r).^^ V (r)d 2 r = ^i?; p ,z;,. 

J A „/ „/ 



(22) 



where 



Zp, 



> p ,= / t;,(r)-x(r)d 2 r. (23) 

J A 

In (20) and (22), R pp i and R' pp , are both P x P scattering matrices, which are equally good at describing the 
behaviour of the array. Unlike the identity matrix, however, they can only transmit the same number of degrees 
of freedom as the array itself regardless of whether the beam patterns constitute a frame with respect to the 
incoming field or not. They also lead to the appropriate correlations for internally generated noise, as will be 
shown later. 

Now consider the situation were an optical system is placed in front of the phased array described by (22). 
We wish to describe the behaviour of the optical system itself in terms of a scattering matrix. Moreover, we wish 
to use the synthesised reception patterns as the basis set on the output side of the optical system, which we shall 
now call T2 = {|t2, P 2}, p2 S {1, • • • , P2}, and some other basis set on the input side of the optical system, which 



we shall call Ti = {|ti jP i), pi G {1, • • • , -PI}. T2 does not have to be a frame with respect to all possible field 
distributions that can appear at the output, say H2, because we are only interested in those fields to which the 
array can couple. Ti does, however, have to be a frame with respect to all possible field distributions that can 
appear at the input, because we are not sure how incoming fields will scatter. If we choose to use the synthesised 
reception patterns as the basis for the input reference surface, we must supplement the set with the complement 
of T2 relative to Hi. Indeed, through this process we can define a virtual array whose beams are T2 H [T2] x . We 
shall not develop this idea here. 

The behaviour of the optical system can be described by 



x 2 (r 2 ) = / N(r 2 , ri )-x(ri)d 2 ri, (24) 

J Si 

where, x 2 (r 2 ) and xi(ri) are the fields on the input and output sides, respectively, and again, paraxial optics is 
assumed. Now we can use the dual frame of Ti, Ti say, to generate a set of expansion coefficients on the input 
side: 

2pi = / tt p i(ri)-xi(n)d 2 n. (25) 

JSl 

and then (24) becomes 

x 2 (r 2 )= f N(r 2 ,ri)-5^o p iti lP i(ri)d 2 ri. (26) 

J Si p i 

We can also express the output field in terms of a set of coefficients 



Substituting (26) in (27) we find 



b P 2= / t* p2 (r 2 ).x 2 (r 2 )d 2 r 2 . (27) 
Js 2 



bp2 = ^2 l M p2p \a p i, (28) 
pi 



where the matrix elements are given by 

M p2pl = [ / t* p2 (r 2 ) -N(r 2 ,n) •ti, p i(n)d 2 rirf 2 r 2 . (29) 
J Si Js 2 

(29) is an operator, which is a matrix for finite dimensional spaces, that maps the field coefficients on the input 
side onto the field coefficients on the output side. The operator describes the process of reconstructing the field 
in the space domain, scattering in the space domain, and then projecting the scattered field onto the output 
basis set. 

If we assume finite dimensionality for all surfaces, and that the output frame of one optical component is used 
as the input frame of the next optical component, then we can cascade a number of components, I, according to 

M = ]jM\ (30) 
1 

where the {M 4 : i G 1 • • • , 7} are the scattering matrices of the individual components. The last component, M J , 
could be the phased array itself, R' in (22), giving a description of the system as a whole. 

Earlier, we showed that it is possible to describe the behaviour of a phased array in terms of the duals of 
the reception patterns, rather than the reception patterns themselves. Equally, we can use either frames or dual 
frames on the input and output reference surfaces of an optical component to generate a variety of scattering 
matrices, each of which describes the behaviour of the component equally well. Moreover, we can choose whether 
to use frames or dual frames, or a mixture, in the definition of the correlation dyadics, thereby generating a variety 
of equally good ways of describing correlations. 10 When representing the process of scattering a partially coherent 
field through an optical component, the correlation dyadics should be chosen to match the bases used for the 
scattering matrices themselves. 



5. IMAGING PHASED ARRAYS 



5.1. Imaging field distributions 

It is not possible to construct a phased array that forms a frame with respect to any undefined complex function, 
even over a finite-sized region, because an infinite number of individual horns would be needed. In reality, 
however, optical fields have finite dimensionality, and frames become feasible. Often, a phased array will be 
placed on the back of an optical system, and the role of the phased array is to collect as much of the information 
that appears at the output of the optical system as possible. We now consider whether the outputs of a given 
phased array form a frame with respect to any field that can pass through a preceding optical system. 

We have shown previously 11 that the behaviour of paraxial optical systems is best described using the 
Hilbcrt- Schmidt decomposition of the operator that projects the field at the input reference surface onto the 
output reference surface: N(r 2 ,ri) in (24). A Hilbert-Schmidt decomposition is needed because optical systems 
generally map fields between different Hilbert spaces, and therefore eigenfunctions are not suitable for describing 
behaviour. 

Thus, the dyadic Green's function in (24) becomes 

N(r 2 ,r 1 ) = 2<7 < u i (r 2 K(n). (31) 

i 

After substituting (31) into (24) it becomes clear that the process of scattering a field through an optical system 
consists of projecting the incoming field onto the input eigenfields Vj(ri), scaling by the singular values Ui, and 
reconstructing the outgoing field through the outgoing eigenfields Uj(r 2 ). It is also clear, and an intrinsic feature 
of the Hilbert Schmidt decomposition, that the field, possibly partially coherent, at the output reference surface 
has only a limited number of degrees of freedom. In the context of (31), the Hilbert Schmidt decomposition has 
only a finite number of singular values that are significantly different from zero. 

What we require is for the synthesiscd reception patterns of our phased array to create a frame with respect 
to the vector space spanned by the Ui(r 2 ) having singular values significantly different from zero: say Hilbert 
subspacc §. In this case the frame is finite, and could, in principle at least, be realised by a finite number of 
horns. How do we determine whether the synthesised reception patterns constitute a frame with respect to §? 

Suppose that |x 2 ) is some general vector in the Hilbert space H 2 at the output reference surface of an optical 
system. S corresponds to that subspace of H 2 spanned by the output eigenfields having singular values greater 
than some threshold value, say a TO j„ > e. In other words, S contains, for all practical purposes, any information 
that could have been transmitted through the optical system. The set of output eigenfields having singular values 
greater than e is {|iij) : i £ 1, • • • , /}, where / < oo, because the throughput of the optical system is finite. Now 
suppose that we have some other set of vectors {|t p ) : 1, • • • , P}, and wish to determine whether {|t p ) : 1, • • • , P} 
constitutes a frame with respect to §. That is to say, if we determine the complex coupling coefficients between 
the |t p ) and any vector in S, can we recover the vector in § without ambiguity? 

If |x 2 ) is some general vector in §, then the frame condition reads 

A || x 2 f< ]T |(t p |x 2 }| 2 < B || x 2 || 2 V|x 2 ) e S, (32) 
v 

or, assuming that |x 2 ) has been normalised 

A<^|(t p |x 2 )| 2 <B V|x 2 )eS. (33) 

p 



For a given set of vectors |t p (r)}, the inner products can be written explicitly, such that (33) takes the form 



t;(r)-x 2 (r)ci 2 r 



<B Vx 2 (r)e§. 



(34) 



We can, however, describe x 2 (r) completely in terms of the output eigenficlds 

x 2 (r) = ^ajUi(r). (35) 



Substituting (35) into (34) gives 



where 



Expanding (36), we get 



where 



<B VaeC 7 , 



E, 



Epid. 

i 

= I t;(r). Ul (r)d 2 r. 

J A 



A<J2 a *' a i R i'i< B VaeC 7 , 

ii' 

Ri'i = y^ j E Vv E pi . 



(36) 



(37) 



(38) 



(39) 



Or, because the number of basis functions I is finite, R can be written as R = E^E, where the elements of E 
correspond to the overlap integrals between the output eigenfields and the synthesised reception patterns: (37). 
Although, the final relationship expresses a mapping of a finite dimensional space onto itself, the mapping passes 
through a space having infinite dimensions and therefore the integral in (37) should be evaluated analytically if 
at all possible. The frame condition (38) then becomes 



A < a f Ra < B = A < a^Ea < B Va e C J . 



(40) 



In order to establish whether {t p : 1, • • • , P} constitutes a frame with respect to the output eigenfields having 
non-zero singular values, we need to determine the limits A and B by rotating a throughout C 7 . Another way of 
thinking about the same problem is that we have some general a, and we wish to determine whether it always be 
described in terms of the vector space spanned by the set of vectors z p , corresponding to the set of all possible 
measurements, given the mapping E. 

The operator R = E^E is Hermitian, and can be diagonalised: 

R = WAW t . (41) 

The frame condition then becomes 

A < a^WAW'a < B VaeC 1 . (42) 

The middle term of (42) takes on its maximum value when the vector a corresponds to the eigenvector of W 
having the largest eigenvalue: remembering that a must have unit length and therefore can only be rotated. If 
W is degenerate in the largest eigenvalue, there is a range of vectors that lead to a maximum, but the outcome 
is still that B — \ m ax- Likewise, (42) takes on its minimum value when the vector a corresponds the the 
eigenvector having the smallest singular value, A = A mi „. If the smallest singular value is zero, R is singular, 
and {|t p ) : 1, • • • , P} does not constitute a frame with respect to S. 

The operator R = E^E simply maps the eigenfield coefficients of the optical system onto the output ports of 
the array and then back again onto the eigenfield coefficients. If the set of basis vectors T do not span all possible 
vectors in S, cither because there are too few of them, or because they do not span the same space, information 
is lost when the frame coefficients are calculated, and the frame is incomplete. It is not possible, therefore, to 
recover complete information about the output field of the optical system from the outputs of the phased array. 
In this case, recovering a with the dual vectors, and then reconstructing the field using the eigenfields, will give 



the best least square approximation to the field. In reality, because of the presence of noise a Bayesian method 
would probably be used to reconstruct the field. 

For infinite-dimensional frames, P — > oo, we can use the same procedure, but now we must calculate the 
eigenvalues of the matrix R, where 

Ri>i= f [ u*,(r)-f(r',r).u,(r')d 2 rdV, (43) 

J A J A 

where 

T(r',r)=£t p (r)t p (r'), (44) 
p 

and the sum over p extends to infinity. Again, these integrals should be evaluated analytically. Clearly, in the 
case where the frame is complete and orthonormal t p (r)t*(r') = 15 (r — r'), giving R = I, supporting the 
validity of the result. 

We now have a measure of how effectively a phased array can image a complex field; it is easy to show that 
when a phased array forms a frame with respect to a fully coherent field at a surface, then it is also possible to 
recover completely the spatial correlations of a partially coherent field at the same surface: essentially because 
the natural modes of the partially coherent field lie within the same Hilbcrt subspacc 8. 

According to (18), in the infinite-dimensional case, 

z P = [ t;(r).x(r)d 2 r (45) 

J A 

x(r) = ^%,tp(r), 

p 

which describes the recovery of a coherent field. Incidently, (45) also shows that J2 p t p (r)t*(r) = I5(r) for an 
over complete or perfectly complete frame. Forming the correlation matrix Z and the correlation dyadic x(r', r), 
using (45), we get 

Z pp < = 11 t;(r).X(r',r).V(r')rf 2 rrfV (46) 

J A J A 

f ( r '> r ) = J2 z pp' l p( r K'( r '^ 

pp' 

which describes the recovery of the spatial correlations of a field from measurements of the cross correlations 
between the outputs of a phased array, using the dual beams. (46) confirms that the correlations of a field can 
also be recovered, if the reception patterns constitute a frame. 

5.2. Imaging intensity distributions 

The previous section describes a calculation that can be performed to find out whether the synthesised reception 
patterns of a phased array form a frame with respect to the output eigcnficlds of an optical system. This 
procedure must be used when one is interested in recovering phase information from the field. Often, however, 
in the case of simple imaging, one is only interested in being able to recover the intensity distribution of a fully 
incoherent source. In this case, certain of the beam patterns needed to form a frame may be created by scanning 
the array physically across the source. It seems, however, that different frames are needed depending on whether 
one is trying to preserve phase or whether one is just interested in measuring intensity: we should distinguish 
between 'field frames' and 'intensity frames'. 

To this end, assume that the source is fully incoherent and unpolarised, but that the intensity varies from 
position to position. The correlation dyadic of the source then becomes 



x(r',r)=I/(r)<5(r-r')., 



(47) 



where J(r) is the intensity as a function of position. Substituting (47) into (10) gives 



Z vv , = [ J(r)t;(r)-V(r)d 2 r. (48) 

J A 

But say that we only measure the diagonal elements of Z through the use of total power detectors, then 

Z pp = f 7(r)fc p (r)d 2 r, (49) 



A 

where k p (r) = t*(r) • t p (r). Thus, for an incoherent source, the output powers of the individual ports of a phased 
array are related to the intensity distribution of the source through a set of inner products with the functions 
Wr):p€l,-,P}. 

If the goal is to reconstruct the intensity distribution of a source, one could ask whether the basis {k p (r) : p e 
1, ■ • • , P} forms a frame with respect to the Hilbert space defining the range of possible intensity distributions. 
There is a problem, however, because in assuming that the source field is spatially incoherent, we assumed that 
the intensity is a member of an infinite dimensional space. To answer the question as to whether the phased 
array is suitable for recovering intensity, we must define more clearly the vector space of intensity distributions 
that is of interest. 

One possible approach is to describe the intensity distribution as a weighted linear combination of basis 
functions, tp n (r). These functions could, for example, be radial basis functions, wavelets, or delta functions at 
sample points. If chosen carefully, these functions need not correspond to a single region, but could represent a 
number of spatially separated regions that one wishes to image simultaneously. If we characterise the space of 
intensity distributions according to 

7(r)=^a„Vn(r), (50) 

n 

then the powers recorded at the output of the phased array become 



Z pp / Oj n F pn: (51) 



where 



The frame condition then reads 



F pn = f k p (v)^ n {v)d 2 v. (52) 

J A 



A < a T F T Fa < B VaeC", (53) 

where A and B, and hence the tightness of the frame, can be determined by finding the eigenvalues of F T F, or 
the SVD of F. In the case where the basis functions correspond to sample points r„, we have V'n(r) = S(r — r„) 
and F pn — k p (r n ). Clearly, the original intensity distribution can be found, to within the degrees of freedom N, 
by using the dual vectors of k p (r), namely k p (r), defined in the space of VVi(r). Usually, however, for stochastic 
sources, and when noise is included, a Bayesian method would be used to recover images. 

It is instructive to see how this form of analysis compares with, and is applicable to, multimode bolometric 
imaging arrays. 12 It has been 13 shown that the expectation value, E[P], of the output of essentially any 
multimode bolometric detector is given by 

E[P] = — / + / /" I(r,r»0f(r,r»(i 2 r(( 2 r'(L, (54) 
2?r J_ 00 J a J a 

where T(r, r', u) is a tensor that characterises completely the physics of the detector, and can include any optical 
system and filters that precede the detector. denotes the full tensor contraction to a single real variable, and 



lu indicates the frequency dependence of the tensors. If we now assume a completely unpolarised, incoherent 
source, as described by (47), then the output of the detector becomes 



E[P] = — / I(r,uj)k(r,Lu)d 2 rdu; 



^ p + oo p 



(55) 



J -oo J A 



where fc(r, uS) is the sum of the diagonal elements of T(r, r', uS) evaluated at a single position. In other words, it 
gives the output of a multimode bolometric detector as a function of position. (55) has precisely the same form 
as (49), and therefore one can use, as before, the theory of frames to determine the degree to which an array of 
multimode bolometric detectors creates a frame with respect to a given class of intensity distributions. In fact, 
a multimode bolometric detector can be created from a phased array by measuring the power arriving at each 
output port, multiplying each of the measurements by some weighting factor, and then adding all of the results 
together. This comparison will become important when we come to consider interferometric phased arrays, and 
the fluctuations and correlations that appear at the outputs of phased arrays. 

To finish this section, it is important to stress that the above analysis applies only when the source is fully 
spatially incoherent; it does not apply to recovering the intensity distribution of a field that is partially coherent; 
in that case, the results of the previous section should be used. Because the source must be fully incoherent, the 
analysis applies to primary sources, although the critical point is that the coherence length must be smaller than 
the interval over which the reception patterns change appreciably. Thus the analysis is appropriate for many 
practical situations, but is not applicable, for example, in the case of recovering the intensity in the focal plane 
of a low throughput optical system. 



We now consider the behaviour of phased arrays in the context of interferometry. By 'interferometric phased 
array' we mean any interferometer where the individual elements are equipped with phased arrays for the purpose 
of creating a number of primary beams on the sky simultaneously. Central to the analysis is the observation 
that an interferometric phased array is essentially a bolometric interferometer, 11 where the individual phased 
arrays are associated with a number of natural modes, which are equivalent to the natural modes of a multimode 
bolometer. 

Suppose that a number of telescopes configured as an interferometer, and that each telescope is equipped with 
a phased array. We know that each port of the beam forming network is associated with a synthesised reception 
pattern on the sky, but equally, we recognise that, in general, these synthesised beams are not orthogonal. In 
this context, we have already described the mapping T : |x) i — ► |z) as T : H — > £ 2 . The phased array acts as 
a linear operator between two Hilbert spaces: one being the space of square integrable functions over the input 
reference surface, and one being the space of square summable complex sequences. For any real system, this 
operator must be Hilbert Schmidt as the amount of information that can be transmitted is finite. The integral 
operator can be written 



which is the equivalent of (31), allowing for the fact that the output is a discrete vector. 

The operation of a phased array can therefore be regarded as first mapping the incoming field onto the input 
eigenfields, Vj (r) , which are orthogonal, scaling by the singular values, Ui , and then reconstructing the complex 
travelling wave amplitudes at the output through the basis vectors U, (p) , which are also orthogonal. Those input 
eigenfields associated with non-zero singular values span the field distributions at the input to which the phased 
array is sensitive, and those output eigenfields associated with non-zero singular values span the vectors at the 
output to which the phased array can couple. Moreover, it can be shown the the input eigenfields associated with 
different telescopes are mutually orthogonal, and therefore the eigenfields of different telescopes can be combined 
to form a single large, orthonormal, composite basis set that can be used to propagate any partially coherent 
field through a complete interferometer. The input eigenfields actually describe those field distributions that can 
be traced to the output ports and then back again onto the sky unchanged in form; they are the eigenfunctions 
of complete round trips. 



6. INTERFEROMETRIC PHASED ARRAYS 




(56) 



The analysis of an interferometric phased array proceeds as follows. Calculate the Hilbert-Schmidt decom- 
position of each telescope, and pick out those eigenfields having non-zero singular values above the threshold, 
e, of interest. Place phase slopes on the eigenfields in accordance with the baselines of the interferometer. This 
procedure has already been described in detail in the context of bolometric interferometers, and will not be 
repeated here. 14 ' 15 The elements of the correlation matrix describing the correlations between the different 
output ports of the phased arrays on different telescopes then become 

V=EE^' U »W U *'W/ / V:(r).I(r',r).V,(r')d 2 rdV, (57) 

■ ~, J A J A 

where the sums over the eigenfields i and i' extend to all telescopes. In the case where the source is spatially 
incoherent and unpolarised, (57) becomes 

Z vv> = £5>i*i'Ui(p)Uj,(p') / /(r)V*(r) • VAr')d 2 r. (58) 
i v Ja 

Z contains complete information about the correlations between the outputs of phased arrays on the same and 
different telescopes in terms of the intensity, I(r), of the field on the sky, and can be used to produce simulated 
fringes. 14, 15 

Given that the combined set of input eigenfields of all antennas spans completely the fields on the sky to which 
an interferometer is sensitive, it would also be straightforward to determine whether a given set of baselines and 
phased arrays comprise a frame with respect to some class of intensity distributions. In reality, the Fourier plane 
is rarely sampled completely, and in any case the calculation of the frame bounds would be computationally 
intensive. 



7. NOISE 

We have described a scheme for analysing the behaviour of phased arrays and interferometric phased arrays, and 
it would be desirable to include noise. Ideally, one should be able to model any internally generated noise by 
using the synthesised reception patterns alone. Also, we need to determine not just the noise power appearing 
at the outputs of an array, but the fluctuations and correlations in the fluctuations in the power arriving at the 
output ports. After all, it is the fluctuations that determine the sensitivity with which a measurement can be 
made. 

If the receiver noise temperatures associated with the primary horns are known, and equal, then the procedure 
is straightforward. By definition, the noise temperature of a receiving channel is the temperature that a matched 
source would need to have in order to generate the same output power as a noiseless, but otherwise identical, 
system. 

Using (20) in matrix form, the correlations between the outputs of a phased array are given by 

Z = R[Z' + Z^ v ]R t , (59) 

where Z' N is the correlation matrix of an equivalent set of noise sources at the input, one for each synthesised 
beam, which are incoherent with respect to the true source Z'. Strictly speaking, (59) assumes that all of the 
noise is in the same spatial modes as the signal, which of course need not be true. (59) can be extended easily 
to account for the more general case, but we shall not do so here. 

To find Z' N we simply need to project a uniform background source having an intensity that is equal to the 
noise temperature onto the synthesised beams; using (21) we get 

Z' N , PP >= I T„t;(r)-V(r)d 2 r. (60) 

J A 

The diagonal terms of Z' N pp , give the noise temperatures that must be associated with each of the synthesised 
beams, and importantly, the off-diagonal terms give the correlations between them. If the beams are orthogonal, 
the noise sources are uncorrelated, and one returns to the original definition of noise temperature. 



8. CORRELATIONS AND FLUCTUATIONS 



The net outcome of the previous sections is the ability to calculate the correlations that appear at the output ports 
of a phased array, or the output ports of phased arrays on different telescopes, from knowledge of the synthesised, 
possibly non-orthogonal and linearly dependent, reception patterns. Once this information is known, many 
measurable quantities follow; including average powers, fluctuations in power, field correlations, and fluctuations 
in power correlations between different ports. Moreover, these matrices contain the Hanbury Brown- Twiss 
correlations associated with phased arrays. The expressions that follow have been derived previously, 6 ' 13 ' 16 ' 17 
but will be reproduced here for completeness. 

Rather than simply determining the output powers of individual ports, it is more general to consider detectors 
that measure the powers at a number of ports simultaneously according to some weighting vector. Characterise 
each weighted combination of detectors by diagonal matrix W G C PxP , where the diagonal elements are the 
factors that weight the sensitivities of the individual detectors that are connected to the phased array. Under 
these circumstances, and assuming radiation whose intrinsic reciprocal coherence time is much greater than the 
bandwidth of the system Au>, which is valid for radio astronomy systems, it can be shown that the expectation 
value of the power E[P], recorded by a detector combination is given by 



Likewise, assuming Gaussian statistics, the fluctuations in the output C ss and the correlations between the 
fluctuations of two outputs C st are given by 



where all quantities are allowed to be a function of frequency The only restriction on (62) is that the post- 
detection integration time r >> 1/Aw where Z(o>) — ► for frequencies outside of Aw, which is necessary for all 
astronomical instruments. 

These expressions can be extended to describe the quantum mechanical behaviour of phased arrays, as has 
been done for bolometric interferometers. 17 The bolometric interferometer model did not, however, include the 
Poisson limit for low photon occupancies. 20 In this volume, 18 we describe how one can add a single term to 
(62) to create a statistical mixture that includes the Poisson noise of photon counting. Once the additional term 
is included, it is possible to take into account the transition from fully Poisson to fully bunched behaviour, as the 
photon occupancies of the incoming modes increase, as one moves from infrared to submillimetre wavelengths. 
It is entirely possible, therefore, to modal the quantum-statistical behaviour of phased arrays at all wavelengths. 

Expressions (61) and (62) offer a further possibility of considerable importance. Clearly, we have a numerical 
procedure for determining the expectation values and the covariances of the powers that arrive at the output 
ports of an imaging, or intcrfcromctric, phased array when a source is present. A discretised version of the model 
has already been published. 6 The model takes into account noise, and can be extended easily to include quantum 
effects. Also, the beam patterns do not have to be orthogonal. (61) and (62) therefore make it straightforward 
to set up a likelihood function for the outputs that would be recorded when some class of source is observed. 
Obviously the likelihood function would contain the signal, its fluctuations, and any instrumental noise, including 
quantum effects, as well as the Hanbury Brown- Twiss correlations between pixels. The source may be as simple 
as a single incoherent Gaussian on the sky, or if one is trying to design a phased array that can observe two 
different regions of the sky simultaneously, it could be two highly separated Gaussians. Any other parameterised 
source distribution could be used; for example, Cauchy functions are often used in astronomy to parameterise 
Sunyaev-Zel'dovich emission from clusters of galaxies, and in (62), we used a Gauss-Schcll source as a convenient 
way of paramctcrising general partially coherent fields. 

On the basis of the likelihood functions, one could then derive numerically, the Fisher information matrix, 18, 19 
from which the covariance matrix of the source parameter estimators could be found. We have already started 




(61) 




(62) 



to apply this technique, in a completely different context, for understanding the design of bolomctric imaging 
arrays: see Saklatvala, 18 in this volume. In order words, one could determine the minimum errors, and the 
confidence contours, that could be achieved when determining the parameters of sources. Exploring how these 
errors change as the design of a phased array changes, say by packing more and more overlapping beams into a 
finite region, would be of considerable interest, and the result should be related, in some way, to the effectiveness 
with which the array forms a frame with respect to the incoming field distributions, or intensity distributions, 
of interest. 

9. CONCLUSION 

We have studied the functional behaviour of imaging phased arrays and interferometric phased arrays, and shown 
that their operation is closely related to the mathematical theory of frames. In order to calculate the behaviour of 
an imaging phased array, or an interferometric phased array, it is only necessary to know the synthesised reception 
patterns, which may be non-orthogonal and linearly dependent. It is not necessary to know anything about the 
internal construction of the array itself. As a consequence, data can be taken from experimental measurements 
or from electromagnetic simulations. The theory of frames allows one to assess, in a straightforward manner, 
whether the outputs of a phased array contain sufficient information to allow a field or intensity distribution to 
be reconstructed in an unambiguous way. 

Our model also allows straightforward calculation of quantities such as the correlations in the fluctuations 
at the output ports of phased arrays. The theory of interferometric phased arrays is almost identical to the 
theory of multimode bolometric interferometers, and therefore, recently developed techniques for modelling 
bolomctric interferometers can be applied to phased arrays also: including quantum statistics. The work opens 
up the important possibility of constructing likelihood functions that enable the covariance matrices of source- 
parameter estimators to be determined. Thus, for example, one could explore the possibility of enhancing source 
reconstruction by packing in more and more overlapping synthesised beams into a region, or widely separated 
regions, of finite size. Any enhancement of the accuracy with which source parameters can be recovered, will be 
related, and to some extent determined, by the degree to which the beam patterns of the array form a frame 
with respect to all possible incoming field distributions. 

In a later paper, we shall use the concepts described here, and the numerical techniques reported previously, 
to simulate and assess the behaviour of interferometric phased arrays when different optical systems and beam- 
forming networks are used. 

APPENDIX A. DERIVING AN EXPRESSION FOR THE TRAVELLING WAVES AT 

THE OUTPUTS OF A PHASED ARRAY 

Suppose that some field, |x), is incident on a phased array, we can represent a measurement of the amplitude 
and phase of the travelling wave z p at p, relative to a normalised reference signal z' p , by the inner product 
(z' p \z) e 2 = z*z p , where \z.' p ) is a vector corresponding to a measurement at port p alone. For example, the 
measurement could be carried out by homodyne mixing the travelling wave z p with a reference oscillator z' p 
at p, and then low-frequency filtering the result. Introducing the linear operator T, leads to a measurement 
of (Zp|Tx)£2, which by definition of the adjoint, can be written (T^z p |x)e- In other words the inner product 
between |x) and the field distribution represented by T^Zp) gives the same result as the measurement, but now 

the inner product is evaluated at the input reference surface. We shall call iT^Zp) the synthesised reception 
pattern of port p. The canonical inner product in H takes the form 

(TV p |x>H= / z;*t;(r)-x(r)d 2 r, (63) 

J A 

where t p (r) is the functional form of the synthesised reception pattern, because the result must be equal to z*z p , 
and therefore conjugate linear in z' p . In (63), the integral over A corresponds to the input reference surface, and 



extends over the region associated with Hilbert space EL Finally, because (63) must be equal to z*z p , we have 
an expression that relates the complex amplitude of the travelling wave at p to the incident field: 

z p = [ t;(r)-x(r)d 2 r, (64) 

J A 

It is clear from (64) that the synthcsiscd reception pattern is the complex conjugate of what would be measured 
in an experiment where a point source is swept over the input surface. The key point is that (64) is valid even 
when the beams are not orthogonal. 
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